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Abstract 



In this paper we consider neighborhood load balancing in the context of selfish clients. We assume 
that a network of n processors and m tasks is given. The processors may have different speeds 
and the tasks may have different weights. Every task is controlled by a selfish user. The objective 



of the user is to allocate his/her task to a processor with minimum load. 



We revisit the concurrent probabilistic protocol introduced in [6], which works in sequential 
QJ ' rounds. In each round every task is allowed to query the load of one randomly chosen neighboring 

processor. If that load is smaller the task will migrate to that processor with a suitably chosen 
probability. Using techniques from spectral graph theory we obtain upper bounds on the expected 
convergence time towards approximate and exact Nash equihbria that are significantly better than 
^ the previous results in [B]. We show results for uniform tasks on non-uniform processors and the 

general case where the tasks have different weights and the machines have speeds. To the best 
of our knowledge, these are the first results for this general setting. 

^ Keywords and phrases Load balancing, reallocation, equilibrium, convergence 

^ Fi Introduction 

Load Balancing is an important aspect of massively parallel computations as it must be 
ensured that resources are used to their full efficiency. Quite often the major constraint 
on balancing schemes for large networks is the requirement of locality in the sense that 
processors have to decide if and how to balance their load with local load information only. 
Global information is often unavailable and global coordination usually very expensive and 
impractical. Protocols for load balancing should respect this locality and still guarantee fast 

^ convergence to balanced states where every processor has more or less the same load. 

• ^ In this paper we consider neighborhood load balancing in a selfish setting. We assume 

that a network of n processors and m tasks is given. The processors can have different speeds 

^ and the tasks can have different weights. Initially, each processor stores some number of 

tasks. The total number of tokens is time-invariant, i.e., neither do new tokens appear, nor 
do existing ones disappear. The load of a node at time t is the total weight of all tasks 
assigned to that node at that time. 

Every task is assumed to belong to a selfish user. The goal of the user is to allocate the 
task to a processor with minimum load. We assume neighborhood load balancing, meaning 
that task movements are restricted by the network. Users that are assigned to the processor 
represented by node v of the network are only allowed to migrate their tasks over to processors 
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that are represented by neighboring nodes of v. Hence, the network models load balancing 
restrictions. Our model can be regarded as the selfish version of diffusion load balancing. 

In this paper we revisit the concurrent probabilistic protocol introduced in 6 . The load 
balancing process works in sequential rounds. In each round every task is allowed to check 
the load of one randomly chosen neighboring processor. If that load is smaller the task will 
migrate to that processor with a suitably chosen probability. Note that, if the probability is 
too large (for example all tasks move to a neighbor with smaller load) the system would never 
be able to reach a balanced state. Here, we chose the migration probability as a function of 
the load difference of the two processors. No global information is necessary. 

Using techniques from spectral graph theory similar to those used in we can calculate 
upper bounds on the expected convergence time towards approximate and exact Nash 
equilibria that are significantly better than the previous results in [6j. We show results for 
uniform tasks on non-uniform processors and the general case where the tasks have different 
weights and the machines have speeds. To our best knowledge these are the first results for 
this general setting. For weighted tasks we deviate from the protocol for weighted tasks 
given in In our protocol, a player will move from one node to another only if the player 
with the largest weight would also do so. It is also straightforward to apply our techniques 
to discrete diffusive load balancing where each node sends the rounded expected flow of the 
randomized protocol to its neighbors ([2 ). 

1.1 Model and New Results 

The computing network is represented by an undirected graph G — {V, E) with vertices 
representing the processors and edges representing the direct communication links between 
them. The number of processors n = |V^| and the number of tasks is to. The degree of a 
vertex v &V is deg(w). The maximum degree of the network is denoted by A, and for two 
nodes v and w the maximum of deg(w) and deg(?i') is d^w 

Si e M is the speed of processor i. We assume that the speeds are scaled so that the 
smallest speed, called Smin, is 1. If all speeds are the same we say the speeds are uniform. Let 
S = 'Ylii^v total capacity of the processors. Define Smax as the maximum 

speed and Smin as the minimum speed of the processors. In the case of weighted task task £ 
has a weight wi G (0, 1]. In the case of uniform tasks we assume the weight of all tasks is 
one. Let W denote the total sum of all weights, W = Wi{x). 

A state X of the system is defined by the distribution of tasks among the processors. For 
the case of uniform indivisible tasks, we denote with Wi(x) the number of tasks on processor 
i in state x. For the case of weighted tasks, Wi{x) denotes the total weight on processor 
i whereas wi G (0, 1] denotes the weight of tasks /. The load of a processor i is defined as 
Wi{x)/si in the case of uniform tasks and as Wi{x)/si in the case of weighted tasks. The goal 
is to reach a state x in which no task can benefit from migrating to a neighboring processor. 
Such s state is called Nash Equilibrium. 

1.1.1 Uniform Tasks on Machines with Speeds 

For uniform tasks, one round of the protocol goes as follows. Every task selects a neighboring 
node uniformly at random. If migrating to that node would lower the load experienced by 
the task, the task migrates to that node with proportional to the load difference and the 
speeds of the processors. For a detailed description of the protocol see Algorithm [l] in Section 
131 
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The first result concerns convergence to an approximate Nasli equilibrium if the number 
of tasks, m, is large enough. For a detailed definition of Laplacian matrix see Section [2] 

► Theorem 1.1. Let ip^ = 167i-A-Sniax/'^2 o-T^d let A2 denote the second smallest eigenvalue of 
the network's Laplacian matrix. Then, Algorithm^ (p. ^ reaches a state x 'with'^o{x) <4:-ipc 
in expected time 

If m > 8 ■ 5 ■ Smax ■ S ■ for some 6 > I, this state is an e-approximate-Nash equilibrium 
with e = 2/{l + S). 

From the state reached in Theorem |1.1| we then go on to prove the following bound for 
convergence to a Nash equilibrium. 

► Theorem 1.2. Let ipc be defined as in Theorem \l.l\ and let T be the first time step in 
which the system is in a Nash equilibrium. Under the condition that the speeds Si are integer 
multiples of a common factor, e, it holds 



E[T] = O 



A2 s4 

^ " "max 



A 



2 



These theorems are proven in Section [3] Our bound of Theorem |1.2| is smaller by at least a 
factor of ri(A • diam(G')) than the bound found in ^ (see Observation 3.28). 

r Table 1 Comparison with existing results 
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This Paper 
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This Paper 
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We summarize the results for the most important graph classes in Table [T] The table 
gives an overview of asymptotic bounds on the expected runtime to reach an approximate or 
a exact Nash equilibrium. We omit the speeds from this table because they are independent 
of the graph structure and, therefore, the same for each column. We compare the results 
of this paper to the bounds obtained from [B] . These contain a factor S = , which we 
replace with n, using S = Si > n. The table shows that for the graph classes at hand, 
our new bounds are superior to those in |B]. 

1.1.2 Weighted Tasks on Machines with Speeds 

In Section |4] we study a slightly modified protocol (see |2j that allows tasks only to migrate 
to a neighboring processor if that would decrease their experienced load by a threshold 
depending on the speed of the processors. This protocol allows the tasks only to reach an 
approximate Nash Equilibrium. 
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► Theorem 1.3. Let ipc — ■ n ■ A/A2 ■ Smax/Smin '^'^'^ -'^2 denote the second smallest 
eigenvalue of the network's Laplacian matrix. Then, Algorithm^ (p. 11) reaches a state x 
with ^'0(2;) < A ■ %l)c in time 





A 








max I 









Under the condition that W > ■ 5 ■ Smax/smin ■ S ■ for some 5 > 1, this state is an 
2/(1 + 5) -approximate Nash equilibrium. 

For the case of uniform speeds the theorem gives a bound of O (ln(m/n) • A/A2) for the 
convergence time. 



1.1.2.1 Outline. 

After presenting the notation and prehminaries in Section [2] we treat the case of machines 
with speeds in Section [3] Section |4] treats the case of weighted tasks. Proofs are found in the 
appendix. 



1.2 Related Work 

The work closest to ours is in [U [SJ [B]. [3] considers the case of identical machines in a 
complete graph. The authors introduce a protocol similar to ours that reaches a Nash 
Equilibria (NE) in time ©(log log m + poly(n)). Note that for complete graphs the NE 
and the optima (where the load discrepancy is zero or one) are identical. An extension of 
this model to weighted tasks is studied in fS^. Their protocol converges to a NE in time 
polynomially in n, m, and the largest weight. In |B] the authors consider a model similar 
to ours, meaning general graphs with processors with speed and weighted tasks. They use 
a potential function similar to ours for the analysis. The potential drop is linked to the 
maximum load deviation, La = max^gy |ei/si|. The authors show that an edge must exist 
over which the load difference is at least La/ diam(G'). As long as the potential ^0 is large 
enough, it can then be shown that there is a multiplicative drop. This is then used to prove 
convergence to an approximate Nash equilibrium. Subsequently, a constant drop in $1 is 
used to finally converge to a Nash equilibrium. The two main results of |S] for machines with 
speeds are presented in Table [T] 

Our paper relates to a general stream of works for selfish load balancing on a complete 
graph. There is a variety of issues that have been considered, starting with seminal papers on 
algorithms and dynamics to reach NE [121 US] . More directly related are concurrent protocols 
for selfish load balancing in different contexts that allow convergence results similar to ours. 
Whereas some papers consider protocols that use some form of global information fl4! or 
coordinated migration [19j , others consider infinitesimal or splittable tasks [18. i3j or work 
without rationality assumptions |17| [T]. The machine models in these cases range from 
identical, uniformly related (linear with speeds) to unrelated machines. The latter also 
contains the case when there are access restrictions of certain agents to certain machines. 
For an overview of work on selfish load balancing see, e.g., |27j . 

Our protocol is also related to a vast amount of literature on (non-selfish) load balancing 
over networks, where results usually concern the case of identical machines and unweighted 
tasks. In expectation, our protocols mimic continuous diffusion, which has been studied 
initially in [101 E] and later, e.g., in [25] • This work established the connection between 
convergence, discrepancy, and eigenvalues of graph matrices. Closer to our paper are discrete 



Clemens Adolphs and Petra Berenbrink 



diffusion processes - prominently studied in [26 , wliere tlie autliors introduce a general 
technique to bound the load deviations between an idealized and the actual processes. 
Recently, randomized extensions of the algorithm in [25] have been considered, e.g., [T^ [2D] . 

f2 Notation and Preliminaries 

In this section we will give the more technical definitions. 

A state X of the system is defined by the distribution of tasks among the processors. For 
the case of uniform indivisible tasks, we denote with Wi{x) the number of tasks on processor 
i in state x. For the case of weighted tasks, Wi{x) denotes the total weight on processor i 
whereas wi S (0,1] denotes the weight of tasks I. 

The task vector is defined as w(x) — {wi{x),'W2{x), ■ ■ • w„(a:))^. We define the load of 
processor i in state x as ii{x) :— Wi{x)/si. In analogy to the task vector, we define the load 
vector as l{x) = {£i{x) , ■ ■ ■ £n{x))^ . For the processor speeds, we define the speed vector 
as s = (si, • • • s„)^ and the speed matrix as S G Su — Si. Let Smax ■— max^gv Si 

denote the maximum speed. The task vector w(a;) and the load vector £{x) are related by 
the speed matrix S via £{x) = S^^w{x). The average load of the network is £i — m/S. In 
the completely balanced state, each node has exactly this load. The corresponding task 
vector is w = m/5 • s and we define e(a;) of the deviation of the actual task vector from the 
average load vector, e{x) = w(x) — w. It is clear that X^igy — ^■ 

A state X of the system is called a Nash equilibrium (NE) if no single task can improve its 
perceived load by migrating to a neighboring node while all other tasks remain where they 
are, i.e., £i — ij < l/sj for all edges (i, j). A state x of the system is called an e-approximate 
Nash equilibrium (e-approximate-NE) if no single task can improve its perceived load by a 
factor of (1 — e), i.e. {1 — e) ■ it — £j < l/sj. 

The Laplacian L{G) is a matrix widely used in graph theory. It is the nx n matrix whose 
diagonal elements are La — deg(i), and the off-diagonal elements are Lij = — 1 if (i, j) e E{G) 
and otherwise. The generalized Laplacian LS~^, where S is the diagonal matrix containing 
the speeds Si [TT], is used to analyze the behavior of migration in heterogeneous networks. 

1 3 Uniform Tasks on Machines with Speeds 

The pseudo-code of our protocol is given in Algorithm [l] Recall that dij is defined as 
max{deg(i),deg(j)}. a is defined as 4smax- 
Algorithm 1: Distributed Selfish Load Balancing 
begin 

foreach task £ in parallel do 

Let i = iil) be the current machine of task I 

Choose a neighboring machine j uniformly at random 

if £i — £j > 1/sj then 

Move task i from node i to node j with probability 

degji) ^ - e, 

end 
end 
end 
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The analysis of this protocol initially follows the steps of |6j up to Lemma 3.3 (Restated 
as Lemma 4.1 in the appendix). Before we outline the remainder of our proof, we introduce 
some more notation. 

► Definition 3.1. For a given state x, we define fij{x) as the expected flow over edge 
It holds 

(^M^IM^ if ^x)-e,{x)>j, 

I otherwise. 
The following two potential functions will be used in the analysis. 

► Definition 3.2. For r = 0, 1, define 
W,{x) ■ {W,{x) + r) 



The potential $o is minimized for the average task vector, w. We define the according 
normalized potential ^!q. 



► Definition 3.3. The normalized potential 'i'o{x) is defined as 



S ^—^ Si 

We want to relate this potential function to the load imbalance in the system. To this end, 
we define a new quantity. 

► Definition 3.4. We define the maximum load difference as 



L\(x) — max 

iGV 



Si S 



= max 
iev 



► Definition 3.5. Let t > be some time step during the executing of our protocol and let X* 
denote the state of the system at that time step. We define A$,.(X*) := <^r{X*^^) — ^r{X^) 
as the drop in potential $r(X*) in time step t. The sign convention for A$r(X*) is such 
that a drop in '^r{x) from time step t — 1 to t gets a positive sign. This emphasizes that a 
large drop in ^r{x) is a desirable outcome of our process. A^o(-''^*) is defined analogously. 

► Lemma 3.6. The shifted potential ^'o(a;) has the following properties. 

(1) The change in ^o{x) due to migrating tasks is the same as the change in <i>o(x), i.e. 

A^'o(X*|X*-i =x) = A$o(X*|X*"^ = x) 

(2) The potential ^'0(2;) can also he written using the generalized dot-product introduced in 



Section 



A. 2 



*o(a;) =Eieye|M = (e,e)s 



► Definition 3.7. With fij{x) the expected flow over edge (i, j) in state x, we define the set 
of non-Nash edges as 

E{x) ■.^{{i,j)eE:f,j{x)>0}. 

This is the set of edges for which tasks have an incentive to migrate. Edges with fij{x) = 
are called Nash edges or balanced edges. 
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► Definition 3.8. As an auxiliary quantity, we define 
K^{x) {2a - 2) • d,, ■ + ^\ U^) + f " f ■ 

\ S'i Sj J Si Sj 

Our improved bound builds upon results in [B]. In that paper, the randomized process is 
analyzed by first lower-bounding the potential drop in the case that exactly the expected 
number of tasks is moved, and then by upper-bounding the variance of that process. This 
leads to Lemma |4.1[ Based on this lemma, we now prove a stronger bound on the expected 
drop in the potential 'i!o{x). Let us briefly outline the necessary steps. The lower bound 
on the drop in the potential in Lemma |4.1| is a sum over the non-Nash edges and contains 
terms of the form £i — £j , whereas the potential itself is a sum over the nodes and contains 
terms of the form if. We will use the graph's Laplacian matrix to establish a connection 
between and the expected drop in "^/q. This will allow us to prove fast convergence to a 
state where ^'o is below a certain critical value ipc- If rn is sufficiently large, this state also is 
an £-approximate Nash equilibrium. In the next stage of our approach, we use a constant 
drop in ^'i, a shifted version of $1, to prove convergence to an exact Nash equilibrium. The 
techniques from probability theory used in this this section are similar to the ones used in [3] . 



3.1 Convergence Towards an Approximate Nash Equilibrium 



To make the connection with the Laplacian, we first have to rewrite the bound in Lemma 4.1 
in the following way. 

► Lemma 3.9. Under the condition that the system is in state x, the expected drop in the 
potentials $0 <^^d, ^0 bounded by 



E[A^'o(X'=+i)|X''- = x] > J2 



{l-l)-{l,{x)-l,{x)f 



n 
a 



Next, we use various technical results from spectral graph theory to prove the following 
bound. 

► Lemma 3.10. Let L be the Laplacian of the network. Let A2 be its second smallest 
eigenvalue. Then 

E[AvI.o(X^+i)|X^- = > ^ . ^ . _ 

In a first step, we get rid of the conditioning of the potential drop on the previous state. 

► Lemma 3.11. Let 7 be defined such that 1/7== A2/(32A • s^^^^). 

Then, the expected value of the potential in time step t is at most 



As long as the expected value of the potential is sufficiently large, we can rewrite the 
potential drop as a multiplicative drop. 

► Definition 3.12. Let A2 be the second smallest eigenvalue of the Laplacian L(G) of the 
network. We define the critical value i/ic as '0c = 8 • n • A • Smax/A2. 
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► Lemma 3.13. Let t he a time step for which the expected value of the potential satisfies 
E[^'o(X*)] > t/'c- Let 7 be defined as in Lemma 3.11 Then, the expected potential in time 
step t + \ is hounded by 

E[vI/o(X*+i)] < (l-^) -Ei^I/oCX*)]. 

This immediately allows us to prove the following. 

► Lemma 3.14. For a given time step T, there either is a t < T so that 'E[^q{X*)] < ijjc, or 

T 



E[*o(X^)] < (^1 - ^) •E[v|/o(X°)]. 



Thus, as long as E[^'o(X*)] > V'c holds, the expected potential drops by a constant factor. 
This allows us to derive a bound on the time it takes until E[^'o(^*)] is small. 

► Lemma 3.15. Let T — 2j ■ ln(m/n). Then it holds 

(1) There is at<T such that E[^'o(X^)] < Vc- 

(2) There is a t <T such that the probability that ^'o(^*) < 4, ■ ipc is at least 

Pr[vI/o(X*) < 4 • Vc] > l- 

This is similar to a result in but our factor 7 is different. This is reflected in a different 
expected time needed to reach an e-approximate Nash equilibrium, as we have pointed out 
in the introduction. 

Next, we show that states with ^o{x) < 4: ■ tpc are indeed £-approximate Nash equilibria 
if the number of tasks exceeds a certain threshold. This requires one further observation. 

► Observation 3.16. For any state x, we have L/^{x)'^ < 'i'Q{x) < S ■ L/^{xY . 

► Lemma 3.17. Let m > 8-(5-n^ -^-Smax for some 5 > 1. Then a state x with ^'0(2^) ^ 4-ipc 
is a 2/(1 + S) -approximate Nash equilibrium. 

► Remark. If m is small, it still holds that we reach a state x with ^'0(2;) < 4 - -tpc, which is 
all we need to prove convergence to an exact Nash equilibrium in the next section. It is just 
that this intermediate state is then not an e-approximate-Nash equilibrium. 

Now we are ready to show Theorem |1.1| 

Theorem 11.11 Lemma [3.1 7 ensures that after T steps the probability for not having reached 
a state x with ^q{x) < 4 • ^/^c is at most 1/4. Hence, the expected number of times we have 
to repeat T steps is less than 

1 + 1/4+1/42 + ... = ^2. 

The expected time needed to reach such a state is therefore at most 2 • T with T from Lemma 
I57[5] < 

If we let the algorithm iterate until a state x with ^'0(2;) < 4 • t/jc is obtained. Theorem |1.1| 
bounds the expected number of time steps we have to perform. However, by repeating a 
suflicient number of blocks with T steps, we can obtain arbitrary high probability. 

► Corollary 3.18. After c-log^^n many blocks of size T, a state with 'i!Q{x) <4:-ipc is reached 
with probability at least 1 — l/n'^. 

Corollary |3.18[ The probability for not reaching a state x with ^o(a^) ^ 4 ■ i/jc after k steps 
is at most 1/4*"'. We are interested in the complementary event, so its probability is at least 
1 — 1/4*^. For k — c ■ log4 n the statement follows immediately. M 
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3.2 Convergence Towards a Nash Equilibrium 

We now prove the upper bound for the expected time necessary to reach an exact Nash 
Equihbrium (Theorem 1.2 p. |3|. To show this result, we have to impose a certain condition 
on the speeds. If the speeds are arbitrary non-intcgcrs, convergence can become arbitrarily 
slow. Therefore, we assume that there exists a common factor e S (0, 1] so that for every 
speed Si there exists an integer S N so that Si — rii ■ e. We call e the granularity of the 
speed distribution. The convergence factor a, which was 4smax in the original protocol, must 
be changed to 4si„ax/e- For non- integer speeds, we have e < 1, so this effectively increases a. 

To show convergence towards an exact Nash Equilibrium we cannot rely solely on the 
potential ^'o(x), because when the system is close to a Nash equilibrium it is possible that 
the potential function increases even when a task makes a move that improves its perceived 
load. Therefore, we now look at potential ^i{x). 

► Definition 3.19. We define the shifted potential function 



^i{x) = $i(x) 



m 
~S 



S 



n 

45 



-E- 



Let Sa and denote the arithmetic mean and the harmonic mean of the speeds, i.e. 

Sa = HidV Si/n and Sh = n/Y.iev V^i- 
Then, we can write 



^i{x) = $i(x) - 



S 



S 



n 
4 



1 

Sh 



1 



► Observation 3.20. The shifted potential ^i(a;) has the following properties. 
(1) Let e = w — w be the task deviation vector. Then 



n 

4Sa' 



(2) ^i{x)>Q. 

3 ^>,{x) = vl/o X +E- + 7- ^- ^ 



(4) A«'i(X*) = A$i(X*). 



Sh 



Before we can lower-bound the expected drop in 4*1 (a;), we need a technical lemma 
regarding a lower bound to the load difference. It is similar to [6, Lemma 3.7], which 
concerned integer speeds, so the result here is more general. 

► Lemma 3.21. Every edge {i,j) with £i — £j > l/sj also satisfies 



Potential differs from potential defined in [3] by a constant only. Therefore, potential 
differences are the same for both potentials and we can apply results for to "^i. 

► Lemma 3.22. // the system is in a state x that is not a Nash equilibrium, then 
E[A^i{X''+^)\X'' = x]> 
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Since the results of the previous section apply to whereas now we work with ^1, we add 
this technical lemma relating the two. 



► Lemma 3.23. For any state x it holds 



Tl TL 



1 

Sh 



To obtain a bound on the expected time the system needs to reach the NE, we use a standard 
argument from martingale theory. Let us abbreviate V :— e^/(8A • s^jj^x)- We introduce a 
new random variable Zt which we define as Zt = + t ■ V. 

► Lemma 3.24. Let T be the first time step for which the system is in a Nash equilibrium. 
Then, for all times t < T we have 

(1) ^[Zt\Zt-i ^ z] < z 

(2) E[Zt] <E[Zt_i]. 

► Corollary 3.25. Let T be the first time step for which the system is in a Nash equilibrium. 
Let t AT be defined as min{t,r}. Then the random variable Zti\T is a super-martingale. 

► Corollary 3.26. Let T be the first time step for which the system is in a Nash equilibrium. 
Then E[Zt] < Zo = 



Now we are ready to show Theorem |1.2| 

Theorem 11.21 First, we assume that at time t = the system is in a state with E[\E'o(-'^"^)] < 
4 • tpc. Using the non-negativity of ^'1(2;) (Observation 3.201 allows us to state 

V ■ E[r] < E[«'i(X^)] + V ■ E[T] = E[Zt] 
(Cor.IMll) < E[Zo] = 

n n 
Sh 4 
n n 
^+4 



1 

Sh 



(Lem.[32i < «'o(^°) + y'*o(^°) 
< 4 • Vc + ^J4^. 
Inserting the definition of a-nd dividing by V yields 



(Lem. [LT} < 8 A • 



A2 

64 • n • A • Si, 

To 



4- 



16 • ri • A • s„ 



A2 



2A 



< 512- A^ . 



. ^ + 91 . A2 . _ 

e A2 e 

„4 



4 

max 



n 

^ + i 
4 



-4- A^ 



A2 



where we have used that 2A/A2 > 1 (Lemma 1.7 1 to pull that expression outside of the 
square root in the first line. 

This bound was derived under the assumption that at i = wc had a state with 
E[^'o(X*)] 4 • ■0c- If this is not the case, let r denote the number of time steps to reach 
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such a state, and let T' denote the additional number of time steps to reach a NE from there. 
Combining the result from above with Theorem |1.1| allows us to write 



E[T] = E[r + T'] = O ( — • A 



n . 9 s 



4 



2 ''max 



► Corollary 3.27. Similarly to Corollary 3.18: after c- log^n blocks of T steps we have 



reached a Nash Equilibrium with probability at least 1 — 



► Observation 3.28. Our bound in Theorem |1. 2 1 is asymptotically lower than the corresponding 
bound in [6| by at least a factor of (A • diam(G')). 



Proof. Lemma 1.5 yields n ■ diam(G) > 4/A2. Additionally, we have S > Smax, since Smax 
occurs (at least once) in the sum of all speeds. Hence, the asymptotic bound from [;6, is 
larger than 

(" • ^ • slax • [A • diam(G)] 

The first part of this expression is the bound of Theorem |1.2[ so the expression in the square 
brackets is the additional factor of the bound from [B]. 

1 4 Weighted Tasks 

The set of tasks assigned to node i is called x{i). The weight of node i becomes Wi{x) = 
J2e£x{i) whereas the corresponding load is defined as £i{x) = Wi{x)/si. 

We present a protocol for weighted tasks that differs from the one described in [B]. It is 
presented in Algorithm |2] 

Algorithm 2: Distributed Selfish Load Balancing for weighted tasks 
begin 

foreach task I in parallel do 

Let i = i(X) be the current machine of task i 
Choose a neighboring machine j uniformly at random 
if ~ij > j- then 

Move task £ from node i to node j with probability 

_ deg(i) - W, 

Pij 



end 
end 
end 



The notable difference to the scheme in jfjj is that in our case, the decision of a task £ 
to migrate or not does not depend on that task's weight. In the original protocol, a load 
difference of more than Wi/sj would suffice for task £ to have an incentive to migrate. In the 
modified protocol, a task will only move if the load difference is at least l/sj. The advantage 
of this approach is that for an edge either all or none of the tasks on node i have an 
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incentive to migrate. This greatly simplifies the analysis. We will show that the system 
rapidly converges to a state where £i — £j < l/sj for all edges (i, j). Such a system is not 
necessarily a Nash equilibrium as £{ — £j might still be larger than the size of a given task 
We. We will show, however, that such a state is an e-approximate NE. 

► Definition 4.1. In analogy to the unweighted case, we define the expected flow fij as the 
expected weight of the tasks migrating from i to j in state x. It is given by 

e,{x) - i,{x) ^ _ i,{x)~i,{x) 



hi^) = — 7^ y . 



a ■ di j 



The potentials $0 and $1 are defined analogously to the unweighted case. Here, we 
concentrate on $0 alone. The average weight per node is W/n and the task deviation is 
defined as Wj — W/n. We define ^"0(2;) in analogy to the unweighted case as the normalized 
version of $0, 



The auxiliary quantity Aij(x) is defined analogously to the unweighted case as 
A,j{x) = (2a - 2) • ■(- + -)■ 

4.1 Convergence Towards an Approximate Nash Equilibrium 

In close analogy to jSl Lemma 3.1], we first bound the drop of the potential when the fiow is 
exactly the expected flow. 

► Lemma 4.2. The drop in potential $0 system is in state x and if the flow is exactly 

the expected flow is bounded by 

A<I>o(X*+1|X*=.t)> V -A, 



The proof is formally equivalent to the one in jB] and therefore omitted here. Next, we bound 
the variance of the process. 

► Lemma 4.3. The variances of the weights on the nodes are bounded via 

Sj. S-{ 



This allows us to formulate a bound on the expected potential drop in analogy to [S] Lemma 



3.3] by combining Lemma 4.2 and Lemma 4.3 



► Lemma 4.4. The expected drop in potential $0 if the system is in state x is at least 
i?[Aa>o(X*)|X*-i=x] > /.j(a;)-(A.,(x)-2). 

The proof is analogous to the corresponding lemma in [B]. 

Theorem 11.31 The rest of the proof is the same as the proof for the unweighted case. One 
may verify that, indeed. Lemma [3. 9 1 and all subsequent results do not rely on the speciflc form 
of ii or the underlying nature of the tasks. Using the same eigenvalue techniques as in the 
unweighted case, this allows us to obtain a bound involving the second smallest eigenvalue of 
the graph's Laplacian matrix. Following the steps of the unweighted case allows us to prove 
the main result of this section. A 
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A Spectral Graph Theory 

In this appendix, we will briefly summarize some important theorems of spectral graph 
theory. For an excellent introduction, we recommend the book by Fan Chung [S]. Many 
important results are collected in an overview article by Mohar 

Results in this section are, unless indicated otherwise, taken from these sources. Let us 
begin by defining the matrix we are interested in. 

► Definition 1.1. Let G = {V,E) be an undirected graph with vertices V — {1, . . . n} and 

edges E. 

The Laplacian L(G) of G is defined as 





deg(i) i = j 



otherwise. 
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The following Lemma summarizes some basic properties of L{G) and, therefore, also of L{G). 
These properties are found in every introduction to spectral graph theory. 

► Lemma 1.2. Let L{G) he the Laplacian of a graph G. For brevity, we omit the argument 
G in the following. Then, L satisfies the following. 

(1) For every vector x G M" we have 

X X/X — ^ ^ ' ^ij ' '^j ^ ^ ^ij ' i^^i -^J ) 

(2) L is symmetric positive semi- definite, i.e., — L and x^Lx > for every vector x. 

(3) Each column (row) of L sums to 0. 



A.l Spectral Analysis 

We now turn our attention to the spectrum of the Laplacian. 



► Definition 1.3. Let L{G) be the Laplacian of a graph G. Lemma 1.2 and the spectral 
theorem of linear algebra ensure that L has an orthogonal eigenbasis, i.e. there are n (not 
necessarily distinct) eigenvalues with n linearly independent eigenvectors which can be chosen 
to be mutually orthogonal. 

We call the eigenvalues of L{G) the Laplacian spectrum of G and write 

A(G) = (Ai < A2 • • • < A„) 

where the Ai are the eigenvalues of L(G'). 

The corresponding eigenvectors are denoted v^. 

The Laplacian spectrum of G contains valuable information about G. Some very basic results 
are given in the next Lemma. 

► Lemma 1.4. Let G he a graph with Laplacian spectrum A(G). For a graph G = {V,E) the 
following holds for hoth the unweighted and the weighted spectrum. 

(1) The vector 1 := (1, • • • , 1)^ is eigenvector to L and L with eigenvalue 0. Hence, Ai = 
is always the smallest eigenvalue of any Laplacian. 

(2) The multiplicity of the eigenvalue is equal to the numher of connected components of G. 
Ln particular, a connected graph has Ai = and A2 > 0. 

The second-smallest eigenvalue A2 is closely related to the connectivity properties of G. It 
was therefore called algehraic connectivity when it was first intensely studied by Fiedler |16j . 
The eigenvector corresponding to A2 is also called Fiedler vector. A first, albeit weak, result 
is the preceding lemma. A stronger result with a corollary useful for simple estimates is 
given in the next lemma. 

► Lemma 1.5 ([23J). Let A2 he the second- smallest eigenvalue of the unweighted Laplacian 
of a graph G. Let diam(G') he the diameter of graph G. Then 

diam(G) > 



• A2' 

4 

► Corollary 1.6. Using diam(G) < n, we get A2 > ^ 
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► Lemma 1.7. This is another useful result by Fiedler fl6f . Let A2 be the second-smallest 
eigenvalue of L(G). Then, 



A, < 



- ■ min{deg(«), z € V}. 



n 

For A the maximum degree of graph G, it immediately follows 

A2 < --A. 

n — I 

A stronger relationship between A2 and the network's connectivity properties is provided 
via the graph's Cheeger constant. 

► Definition 1.8. Let G = {V, E) be a graph and S (ZV a, subset of the nodes. The boundary 
5S of S is defined as the set of edges having exactly one endpoint in S, i.e., 

5S={{i,j)&E\i&S,jeV\S}. 

► Definition 1.9. Let G = (V, E) be a graph. The isoperimetric number i{G) of G is defined 
as 

i{G)^ mm . 
scv \S\ 

|S|<|V|/2 

It is also called Cheeger constant of the graph. 

The isoperimetric number of a graph is a measure of how well any subset of the graph is 
connected to the rest of the graph. Graphs with a high Cheeger constant are also called 
expanders. The following was proven by Mohar. 

► Lemma 1.10 ([22]). Let A2 be the second- smallest eigenvalue of L{G), and let i{G) be the 
isoperimetric number of G . Then, 

< A2 < 2^{G). 

This concludes our introduction to spectral graph theory, which suffices for the analysis 
of identical machines. For machines with speeds, it turns out that a generalized Laplacian is 
a more expressive quantity. 

A. 2 Generalized Laplacian Analysis 

Recall the speed-matrix S from the introduction. Instead of analyzing the Laplacian L, we 
are now interested in the generalized Laplacian, defined as LS^^. This definition is also used 
by Elsasser in [TT] in the analysis of continuous diffusive load balancing in heterogeneous 
networks. In this reference, the authors prove a variety of results for the generalized Laplacian, 
which we restate here in a slightly different language. 

It turns out that in the discussion of the properties of this generalized Laplacian, many 
results carry over from the analysis of the normal Laplacian. The similarity is made manifest 
by the introduction of a generalized dot-product. 

► Definition 1.11. For vectors x,y e M", we define the generalized dot-product with respect 
to S as 



Si 
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► Lemma 1.12. The vector space M" together with {■,-)s forms an inner product space. 
This means that 

(1) (x,y>s = (y,x)5, i.e., {■,-)s is symmetric, 

(2) (axi + 6x2, y)s = y)s + 6(x2; y)s for any scalars a and b, i.e., (•, •)s is linear in its 
first argument, 

(3) (x,x)5 > 0, with equality if and only if x = 0. i.e., (•, ■)s is positive definite. 

Proof. All three properties follow immediately from Definition provided the Si are 

positive, which is true in our case. -4 

► Remark. The fact that (•, ■)s is an inner product allows us to directly apply many results 
of linear algebra to it. For example, all inner products satisfy the Cauchy-Schwarz inequality, 
i.e., 

(x,y>| < (x,x)5 • (y,y)s. 

A proof of this important inequality can be found in every introductory book on Linear 
Algebra. 

Another concept is that of orthogonality. Two vectors x and y are called orthogonal to 
each other, x±y, if x • y = 0. Analogously, we call x and y orthogonal with respect to S if 
(x,y)5 = 0. 

Let us now collect some of the properties of LS~^. These properties have also been used in 
PT| . We restate them here using the notation of the generalized dot product. 

► Lemma 1.13. (Compare Lemma 1 in ^111) Let L he the Laplacian of a graph, and let S 
be the speed-matrix, S — diag(si, • • • , s„). Then the following holds true for the generalized 
Laplacian LS~^. 

(1) The speed-vector s = (si,...,s„)^ is (right-) eigenvector to LS~^ with eigenvalue 0. 

(2) LS^^ is not symmetric any more. It is, however, still positive semi-definite. 

(3) Since LS^^ is not symmetric, we have to distinguish left- and right-eigenvectors. Similar 
to the spectral theorem of linear algebra, we can find a basis of right-eigenvectors of LS^^ 
that are orthogonal with respect to S. 

Proof. (1) 

LS^^s = LI = 



via Lemma 1.4 For (2) and (3), suppose that x is a right-eigenvector of LS ^ with eigenvalue 



A. If we define y :~ S ^^^x, then we have 

LS-^x = Ax 
LS-^/^y = A^i/^y 
^ S-^'^LS-^'^y = Ay. 

This proves that x is right-eigenvector to LS^'^ with eigenvalue A if and only if S'^^/^x is 
eigenvector to S^^^^LS^^^^ with eigenvalue A. The latter matrix is positive definite, because 
for every vector x, we have 

x'^S-'/^LS-'^^x = (S-'/^x)'^ L{S-'/^x) > 

since L itself is positive semi-definite. Now, since S^^^^LS^^^^ is symmetric positive semi- 
definite, all its eigenvalues are real and non-negative and it possesses an orthogonal eigenbasis. 



18 



Distributed Selfish Load Balancing with Weights and Speeds 



Let us denote the n vectors of the eigenbasis with y*^, k ~ 1 . . . n. As we have just shown, 
this imphes that the vectors x''" — S^^^y'^ are right-eigenvectors to LS^^ . Since 5^/^ is a 
matrix of fuU rank, the x'" form a basis as well. Their orthogonality with respect to 5 follows 
from 

(x^x')5 = (x'=)^5-lx' 

-(y')^-y'=0 forfc^L 



For arbitrary vectors, we know that (x,_L5^^x)5 > since S^^LS^^ is positive semi-definite. 
The next lemma bounds the generalized dot product of certain vectors with the Laplacian 
with the second smallest right-eigenvector of it. A similar version can also be found in fTTJ 
Section 3]. 

► Lemma 1.14. Let A2 denote the second-smallest right-eigenvalue of the generalized Lapla- 
cian, LS^^ . Let e be a vector that is orthogonal to the speed vector with respect to S, i.e. 
(e, s)s = 0. Then 

{e,LS-^e)s > A2(e,e>5. 

Proof. Let (Afc,Vfc) denote the fc-smallest eigenvalue and corresponding eigenvector of LS^^. 
For the speed vector, s, we have LS~^s — (Lemma 1.131. Thus, we can just identify vi = s. 
Recall from Lemma [l . 13| that the form a basis of K". Therefore, e can be written as a 
linear combination of these eigenvectors. For some real- valued coefficients /3k, we have 



e = 

fc=i 

Since the basis vectors are mutually orthogonal with respect to S, and since e is orthogonal 
to s with respect to S*, s vi does not contribute to the linear combination of e, because 

= (e,vi)s = /3i(vi,vi). 

This can only be satisfied if either all speeds are zero or if f3i — 0. Therefore, we can write 



k=2 

Substituting this decomposition into the Bound of Lemma |1.14| yields 

n n 

(e, LS-^ = Y.^^- 

n 

= J2'^''' i^kWk,^k)s 

k=2 

n 

- -^2 • Y{l3kVk,l3k^k)s = A2 • (e,e)s. 



k=2 k=2 



k=2 
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The next technical lemma is needed to relate the spectra of L and LS^^ . We require this 
relation because most of the useful results and bounds for A2 apply to the normal Laplacian 
only. 

► Lemma 1.15. Let fii denote the eigenvalues of LS~^ in ascending order and let \i denote 
the eigenvalues of L in ascending order. Finally, let Si denote the speeds in descending order. 
Then 

fJ.i+j-1 > — 0<i,j<n, 0<i+j-l<n (1) 

fJ.^+j-n < — < i,j < n, < i+ j ~ n < n. (2) 

Lemma [1.151 The matrices L and S^^ are symmetric positive semi-definite. Hence, their 
square-roots exist and are unique. Let X — VL and T = V S~^. The singular values of X 

are y/JIl and those of T are \J~sY^ . In addition, the singular values of XT are \/Ai. 

By Theorem 4.3 there exist symmetric matrices Hi with eigenvalues log V/i^ and H2 



with eigenvalues log \J s^^ , and the eigenvalues of Hi + H2 are log By Theorem 
these satisfy the inequalities 



4.2 



log ^JXi+j-l > log ^ + log J s. ^ =\ogJ fi,s^ ^ 



log \/>^t+j-n < log y/jM + log y Sj ^ = log ^ ^1^SJ ^ 

Since both the logarithm and the square-root are monotone functions, the desired result 
follows immediately. < 

► Corollary 1.16. Let /i2 denote the second smallest right eigenvalue of LS^^ and let A2 
denote the second smallest eigenvalue of L. Let Smax — Si be the largest speed and Smin = s„ 
the smallest speed. Then 

^2 , ^ -^2 

< < • 

-^max '^min 

Proof. Let i = 2, j = 1 in ([T| and i ^ 2,j = n in ([2|. A 

Proofs from Section [s] 

Lemma [3.61 (1) By definition, ^"0(2;) and ^o{x) differ by m? /S. The total number of tasks, 
m, and the sum of all speeds, S , are constants. Therefore, the difference between ^'0(2;) and 
'^(){x) is constant at any time and we have 



S 
=0 



Hence, both the original and the shifted potential have the same potential drop. 

(2) This follows immediately from Definition 1.11 of the generalized dot-product. < 
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B.l Proofs from Section [3711 

Lemma 13.91 For brevity, we omit the argument x from all quantities. Note that we can look 
at the drop of either $o or '^q. Substituting the particular forms of fij and A^^ (Definitions 



3.1 and 3.8 1 into the bound provided by Lemma 4.1 of Lemma 3.3 in we arrive at 



(*) 



We define subsets E, Ei and E2 of E, 

E=l{ij)eE\£,-e,>- 



Ei^\{i,j)eE\£,^£j > - + - 

Si S rj 



E2 — E \ El. 

Note that E — EiU E2 and Ei (1 E2 = 9- Thus, we can split the sum in Q into a sum over 
El and a sum over i?2. We will now bound these sums individually. 

Let (i, j) G El be an edge in Ei so that ii > Ij. Then the definition of Ei and the 
non-negativity of £i — £j allows us to deduce 



£i - Ij > — + — ^ 



This allows us to bound 



E 

(*j)e-Ei 



a ■ d. 



1,3 



> 



E 



1- 



(*) 



Next, we turn to E2 and bound 

(2-f).(£,-^,)2 (£,-£,) 



E 

(»j)e-E2 



■ d 



The sum is over two terms, a positive and a negative one. For the first, positive term, we 
simply bound 



E 

(»j)e-E2 



a ■ 








dij 


ii- 











^ E 



For the edges in E2, we have £i — ij < 1/si + 1/sj. This allows us to bound the second, 
negative term via 



E 

(*j)e-E2 



- I, 1 



• d. 



E 



1/1 1 



(*j)6-B2 



I, J 
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Combining ([*]) , ([**]) and ( * * * I yields 



E[A^'o(X'=+i)|X'^- = x] > J2 



> 



E 



• di 



E 

(»j)e-E2 



a • di 



1 1 

— + — 

S i S 



(t) 



In the next step, we rewrite the sum over in (jfj to a sum over all edges E, using 
E — E \ {E \ E) . It generally holds for any terms ^(i.j) that 

E = E ^(^.i) ~ E 

(ij)e-B {i,j)eE\E 

We will apply this to Q. In the following, we therefore prove an upper bound on the sum 
over E\E. Without loss of generality, let the nodes i and j of an edge be ordered such that 
£i > ij. For edges not in E, we have, by definition, < li ~ £j < ^, so this part can be 
bound by 



E 



{i,j)&E\E 



a ■ di 



1 , i\ ^ ^' - ^ a-( 



{t,j)eE\E 

^ E 

(t.j)eE\E 

- E 

(t.j)eE\E 

^ E 

(t.j)eE\E 



1 


a 


■ 




1 


a 


• dij 




1 


a 


■ dij 




1 


a 


■ dij 



S q I ^ '] ^ 1 



1 1 



This bound has the same form as the bound in only that it goes over E\E instead of 

i?2- These two sets are disjunct, since E2 C E. Therefore, we can combine the two sums into 
a single sum over E2 ^ (E \ E) = E\Ei. We then obtain from the following bound from (jfj). 



(i,i)eE 

- E 

{i,3)eE\Ei 



■ d, 



a ■ di 



1 1 

— H 

Si Sj 



(tt) 



The first term of this bound already has the desired form. We will now bound the second 
term. Since it is negative, we have to upper bound the sum itself. First, note that E\ Ei is 
a subset of E. As the term inside the sum is non-negative, we can write 



22 
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Recall that dij is defined as max{deg(i), deg(j)}, so we can bound 



a ^ 



< 



a ^ 

a ^ Si a 



1 



1 



deg(i) • s, deg(j) • Sj 
1 



a ^ — ' ^ — ' deg(z) • Si 



Inserting this bound into (ff I yields the result. < 

Lemma [3.101 We start from the bound obtained in Lemma [3^9] In the course of this proof, 
we will use Lemma |1.14| for the task deviation vector e. In order to use this lemma, we have 
to show that e satisfies the lemma's condition, i.e., (e,s)5 = 0. This follows via 

(e, s)s = = ^ e, = 0. 

i£V * iev 

We can now begin with the main proof. 
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Lemma fS-lll First, we find a bound for E[4'o(^*)|X* ^ = x], i.e., the expected value of 
the potential if the previous state was x. There, we have 

E[^o{X')\X'-' = x] = vI/o(x) - E[AvI/o(X*)|X*-i = x] 



fLem. lQj)) < ( 1 - - ) • ^'o(a;) + 



4 • Sn 



Using this, we now use the tower property of iterated expectation to obtain 
E[^'o(X*)] = E[E[«'o(X*)|X*-i]] 



< 1- 



4 • Sn 
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Lemma [3.131 We can rearrange the condition E[^'o(X*)] > if^c as follows. 

8 • n • A • Smax 



E[«'o(^*)] > 



A 



2 



32 A s2 L A-s,, 



= 1/7 

We insert this into the bound from Lemma 13.111 to obtain 



7/ 4 • s 



max 



7/ 7 



Lemma 13.141 The proof is by induction on t, with t = as the base case. There, we have 

mo{X^)]< (^l-^y •E[v|/o(X")]. 

Next, let the claim be true for a given t. Either there is a t' < t such that E[^'o(X* )] < ipc 
and we are done. Otherwise, we can apply Lemma |3.13| to obtain. 

mo{X'+')] < (^1 - • E[vI/o(X*)] 

(Induction Hypothesis) <(l j •E[^'o(X")]. 



Lemma [3.151 (1) We write down the inequality we want to prove and then show that the 
T given in the lemma makes it true. Since we use Lemma |3.14| to connect the expectation 
value of ^'o at time T to its value at time t — 0, we first note that 

*o(^°) < m^ (*) 

as the largest potential is obtained by the largest imbalance, i.e., assigning all m tasks to the 
slowest node. Then, we can deduce 

n^^oix^)] < 



A 



2 



T ■ ln(l - 1/7) + In (m^) < In ( ^^^ ' ' '■""" ^ 



Next we use Lemma 1.7, which states A2 < n/{n — 1) • A, together with 71 — 1 > n/2 for 
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n > 2 to further rewrite the bound. 

T ■ ln(l - 1/7) + hi (to2) < In |^ 16A.n. 

(Lem.lLTl ^ • r + 21n(m) < In (16(n - 1)) + ln(s„iax) 

7 

^ -7 + 210(771) < ln(8n) 

7 

-- -7 + 2111(777,) < ln(n) 

7 

This can be rearranged to yield the condition on T, 



T>27-ln(-] 



n J 



(2) To prove a lower bound on the probability that ^'o(^*) < 4 - ipc, we prove an upper 
bound on the complementary event. Let Y denote the random variable \E'o(X*), with the t 
from part (1) of this lemma, i.e. the t for which E[^'o(^*)] < tAc- Then, we have 

Pr[^'o(X*) > 4 - Vc] < Pr [^'o(^*) > 4 - E[«'o(X*)]] . 

Applying Markov's inequality to this result immediately yields 

Pr[^'o(X*) >4-E[^'o(X*)]] < J. 

Hence, 

Pr[vI/o(X*)<4-^e]>l- J = 



Observation 13.161 We omit the argument x for brevity. Let us begin with the first inequality. 
Let k be the index of the node for which |efe/sfc| is maximized. Then 

p2 2 



The second inequality follows from 



■iey iev iev 



Lemma [3.171 From Observation 3.16 we have i^(x) < ^'0(2;). Hence, we have for all states 
with ^'o(a;) < 4 - -0^: 



/77 - A 2 

La < 16 - W — - s„iax < 8 - 77 - Smax =: a 



where Corollary 1.6 states that I/A2 < 77^/4. 
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Next, we define e = 2/(l + (5). The condition for an e-approximate Nash equiUbrium is 
that for every edge £ E we have 

Si Sj 

To prove that this is the case, note that the definition of La ensures that 



m 
5 



< La 



-<La 



< a- 



TO 
5 



and, analogously 



Wi TO 

Si 6 



With this, we have 



< 



1 / TO\ /TO \ 



s + 

J. mm 

<^ m>6-a-S — 8-S - Smax ■ S ■ n^. 

Thus, if the number of tasks is sufficiently high, we have an e-approximate Nash equilibrium. 



B.2 Proofs from Section O 

Observation 13.201 For brevity, we omit the argument x. We begin with (1). This is simple 
algebra. 
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4- Sa 



Next, we prove (2). From the form in (1) it might appear that \E'i(a;) can be negative. 
Note, however, that the task deviation vector e satisfies = 0- We can use the technique 

of Lagrange multiplicators to find the minimum of ^'i(x) under the constraint that the 
deviations sum to 0. For an additional parameter A, the so called Lagmnge multiplier, we 
define the Lagrange function 

£(ei, . . . , e„; A) = ^ - A • ^ e,. 
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The constrained minimum of ^'i(a;) is obtained for the solution of 

Carrying out the calculation shows that, indeed, minimum value of \E'i therefore is 0. 
(3) is obtained by first rewriting $1 as 

Recall the definition of ^'o = ^"0 ~ m? /S. Hence 
m? m ■ n n f \ 1 

S S A \Sh Sa 

^ ph Tn'^ n / 1 1 

° '—f.Si S 4 V S/i Sa 

v-^ Ci n f 1 1 



(4) follows from the definition of by observing that apart from the term $i(x), 

everything else is constant. 



Lemma 13.211 We have 

Wi ■ Sj — Wj ■ Si > Si 

<^ Wi ■ Uj ■ e — Wj ■ rii ■ e > rii ■ e 
Wi • rij — Wj ■ rii > rii 

Since the lefthand side and righthand side expressions are integers, this leads to 



•O- Wi ■ rij — Wj ■ Hi > Hi + 1 

Wi ■ Sj — Wj ■ Si > Si + e 

Sj Si ' Sj 
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Lemma [3.221 Apply to 4.1 then bound the result. 
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Lemma [3.231 Observation |3.20" (3) states that with e denoting the task deviation vector 
and 1 denoting the vector (1, . . . , 1)^, we have 



^'i(a;) = *o(a;) + (e,l)s + 



n / 1 1 



4^ \Sh Sa 



It remains to bound the dot-product in the equation above. Since (•, -js is an inner product, 
it obeys the Cauchy-Schwarz inequality and we have 



|(e, l)s|2 < (e, e)s ■ (1, l)s - ^o{x) • ^ - = ^o{x) ■ 



n 

Sh 



Lemma 13.241 The first part is obtained from simply inserting the definition. Since for times 
t <T the system is not in a Nash equilibrium, we can use Corollary |3.22| to write 

E[Zt\Zt-i = z] = E[«'i(X*)|^'i(X*-i) + it- l)V = z] 

< - V + tV = z - {t - l)V - V + tV = z. 

For the second part, note that 



Corollary |3.25[ For time steps t < T, we have t /\T — t and we can just use Lemma [3.24| 

Note that the expected constant drop in potential ^'i only depends on the current value of 
the potential. Hence 



E[Zt\Zo, Zi,... Zt^i] = E[Zt;xT|^t-i ^z]<z. 
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For time steps t > T, we have 

^[Zt\Zo, Zi, . . . Zt-i] = F,[Zt\Zo, . . . Zt-i] = Ei[Zt\Zt^i = z] < z. 

Corollary |3.26[ First, note that the random variable T is a stopping time for ^(aT; because 
the event that T = t for some time step t depends only on the state X* and, most importantly, 
does not depend on some X* for t > T. The Optional Stopping Theorem allows us to obtain 
the claim of the corollary for a stopping time T if E[T] < cxi and E[|Zf+i/\T — ZiatD < c for 
some constant c. The first condition follows from the constant drop in the potential as 
long as the system is not in a Nash equilibrium. The second condition follows from 

E[\Zt+iAT ~ Zt^rW < |A*i(X*+i)| <max«'i(x) + y. 

X 

For any given system, this expression is clearly a constant. The Optional Stopping Theorem 
for super-martingales now states that 

E[Zt] = E[Ztat] < E[Zo] = 

^C" Proofs from Section [4] 

Lemma 14.31 As in the original reference, we introduce random variables Ai and Ci for the 
tasks abandoning and coming to node i, but now they count the weight of these tasks, not only 
their number. For the Ci, we again split it into Zji where Zji counts the weight migrating 
from j to i. Then 

Var[a;]= Var[Z,,]. 

Var[Zj,] - ^ Var[w;f]. 

Here w-'/ is the random variable that is we if task £ moves from j to i and otherwise. This 
variable follows a Bernoulli distribution. If p is the probability for the event to occur and if 
X is the value of the event, then the variance is 

Var[Ber(x,p)] = • p • (1 - p) < x'^p. 

This allows us to write 

Var[Z,,] = ^VarK] 
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where we use that w'^ < we since all tasks have weight at most 1. Hence 



Similarly, we define the random variable An that is wi if task H. abandons node i and 
otherwise. It is also Bernoulli-distributed. 



Ber wi; ^ 



< E E^^ 
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When we add the variance of Ci and Ai and sum over all nodes, we get, in formal analogy to 
the unweighted case. 
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Fd Auxiliary Results 

In this part we collect results from other papers that are essential in our own proofs. 
► Lemma 4.1 (Lemma 3.3 in [6J). For any step t and any state x, 



{i,3)eEix) 



AUx) 



1 1 



► Theorem 4.2 (|28|. jT]). Let A, B and C — A-\- B he Hermitian matrices with eigenvalues 
ai, Pi and "fi in ascending order. Then 

li+]-i >ai + (3j 

li+j-n < + Pj 

► Theorem 4.3 (Theorem B from [21]). The following conditions are equivalent 

1. There exist matrices Ai e GL(n,M) with given singular spectra 

ai = a{Ai) and a — a{AiA2 ■ ■ ■ A^^). 

2. There exist symmetric n x n-matrices Hi with spectra 

X{Hi) = log ai and X{Hi + H2 + ■ ■ ■ + H^) = log a, 

that is, the eigenvalues of Hi are the logarithms of the singular values of Ai, and the 
eigenvalues of^^Hi are the logarithms of the singular values ofYliAi. 



